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[57] ABSTRACT 

A video communications system for transmitting video data 
between a plurality of transmitting nodes and one or more 
receiving nodes. The video data at each transmitting nodes 
is scaled and assigned a display location at the receiving 
node(s) prior to transmission . The receiving node(s) simuJ- 
taneously display video data received from each of the 
transmitting nodes. The video communications system mini- 
mizes the use of bandwidth, and uses a simple, inexpensive 
and eflicient encoding and decoding system. 

20 Claims, 5 Drawing Sheets 
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MULTIPLE VIDEO SCREEN DISPLAY 
SYSTEM 

FIELD OF THE INVENTION 
The present invention relates generally to the field of 
video data communications, and more particularly to video 
data processing and transmission. 

BACKGROUND OF THE INVENTION 

With the present day advancement of high-bandwidth 
communication infrastructure and the widespread accep- 
tance of digital video compression standards, there has been 
an increasing demand for video-based services. Among 
these new and expanding services are long distance 
education, surveillance systems, video -on -demand, interac- 
tive video games and video conferencing. Importantly, these 
and other future video-based services will need cost- 
effective and efficient video data processing and transmis- 
sion systems and methods. 

A typical multi-windows display system will display 
multiple video sequences to a video user. The windows 
environment allows the user to simultaneously view several 
video sequences or images originating from several different 
sources. However, prior art multiple window display sys- 
tems have made inefficient use of bandwidth. Moreover, 
prior art multiple window display systems have needed 
complex encoding and decoding systems and methods, 
which are both costly and have significant processing 
delays. 

In the case of digital image transmission applications, 
such as digital television, it has often been necessary to 
compress the image data in order to conserve bandwidth. In 
this regard, a frame of video (i.e., one full screen) may be 
composed of an array of at least 640x480 pixels. A video 
sequence is composed of a series of frames. In order to 
obtain a standard quality video sequence, a frame rate of at 
least 24 frames per second is necessary. To transmit this 
quantity of image data using the available bandwidth, vari- 
ous well known compression techniques have been 
employed. These compression techniques typically take 
advantage of pixel image data repetition, known as spacial 
correlation. Spacial correlation occurs when several adjacent 
pixels have the same or similar brightness and color values. 
Data compression techniques take advantage of this repeti- 
tion by transmitting the brightness and color data from one 
pixel and transmitting information on the number of follow- 
ing pixels for which the data is identical, or by transmitting 
only the brightness and color data "difference" between 
adjacent pixels. Several video compression standards have 
become widely adopted, including MPEG1, MPEG2, JPEG 
and px64. However, it should be appreciated that in some 
situations compression alone does not reduce bandwidth 
consumption as much as desirable. Therefore, there is a need 
to further reduce bandwidth consumption. 

Prior art multiple window display systems have also 
failed to address the problem of complex and costly 
encoding, decoding and other needed video process systems. 
In this regard, prior art systems do not encode the final 
display location of the video data at the receiving location. 
As a result, the decoding is made more complex, since the 
display information must be re-coded with the proper dis- 
play location. 

"the present invention overcomes these and other draw- 
backs of prior art systems. 

SUMMARY OF THE INVENTION 

The present invention is directed to a video communica- 
tions system for transmitting video data from a plurality of 



)5,146 

2 

transmitting nodes to one or more receiving nodes. The 
transmitting nodes include scaling means for reducing a full 
size image to a scaled image, compression means for com- 
pressing the scaled image, and a display location means for 

5 providing a display location address for the scaled image. 
The system further includes a combiner means for combin- 
ing scaled images from each of the transmitting nodes in 
accordance with the display location address, to form a 
combined image. The receiving nodes includes decompres- 

10 sion means for decompressing the combined image gener- 
ated by the combiner means, and display means for display- 
ing the decompressed combined image comprised of the 
scaled images originating from the transmitting nodes. 
It is an object of the present invention to provide a video 

35 communications system which prescales a video image prior 
to transmission. 

It is another object of the present invention to provide a 
video communications system which conserves transmis- 
sion bandwidth. 

20 

It is another object of the present invention to provide a 
video communications system which determines and assigns 
a final display location of the video image prior to encoding 
and transmission to a receiving location. 
25 It is still another object of the present invention to provide 
a video communications system which minimizes the com- 
plexity of the encoding and decoding circuitry. 

It is still another object of the present invention to provide 
a video communications system which is fast, efficient and 
30 minimizes processing delays. 

The above discussed objects, as well as additional objects 
and advantages of the present invention will become more 
readily apparent by reference to the following detailed 
description and the accompanying drawings. 

35 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention may take physical form in certain parts and 
arrangement of parts, a preferred embodiment of which will 
4Q be described in detail in the specification and illustrated in 
the accompanying drawings which form a part hereof, and 
wherein: 

FIG. 1 illustrates an example of a video communications 
system according to a preferred embodiment of the present 
45 invention; 

FIG. 2 A illustrates a full size picture prior to scaling; 
FIG. 2B illustrates the picture in FIG. 2 A as scaled to 
quarter size; 

FIG. 3 provides a functional block diagram of the encod- 
50 ing system according to a preferred embodiment of the 
present invention; 

FIG. 4 shows a functional block diagram of a combiner 
according to a preferred embodiment of the present inven- 

55 ' i0n; 

FIG. 5A illustrates MPEG standard Elementary Stream 
(ES) headers and payloads originating from a plurality of 
transmitting nodes; and 

FIG. SB illustrates MPEG standard Elementary Stream 
60 (ES) headers and payloads for a combined picture to be 
displayed at a receiving node. 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENT 

65 Referring now to the drawings wherein the showings are 
for the purpose of illustrating a preferred embodiment of the 
invention only, and not for the purpose of limiting same, 
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FIG. 1 illustrates an example of a video communications 
system 10 in accordance with a preferred embodiment of the 
present invention. While the exemplary system in FIG. 1 
shows a five node video communications system, any num- 
ber of nodes arc possible using the present invention. 5 
Moreover, it should be appreciated that there may be more 
than one receiving node, and that each receiving node may 
also be a transmitting node, and vice versa. In this respect, 
each node may both transmit and receive picture data, or 
only perform one of the foregoing functions. no 

It should be understood that while a preferred embodi- 
ment of the present invention will now be described with 
reference to a video communications system using the 
MPEG2 video compression method, it is contemplated that 
the present invention may be used in conjunction with other 15 
video compression methods, including MPEG1, JPEG and 
px64. Moreover, the present invention may also be used with 
infraframc, intcrframc and motion compensated video com- 
pression methods. 

Video communications system 10 is generally comprised 20 
of transmitting nodes 1 through 4, a communications net- 
work 50, a receiving node 60 and a communications man- 
ager 65. Transmitting nodes 1-4 and receiving node 60 may 
take the form of a workstation or a video conferencing 
system. The source of the picture data may be a video 25 
camera, a video cassette recorder (VCR), or other suitable 
video sources. Transmitting nodes 1 through 4 respectively 
include encoders 30 which encode pictures 1 through 4. 
Encoders 30 transmit picture data to receiving node 60 
through communications network 50. Encoders 30 will be 30 
described in detail below. 

Communications network 50 is a communications link for 
transferring data between transmitting nodes 1-4 and receiv- 
ing node 60. Communications network 50 may take the form 35 
of any suitable communications link. Preferably, communi- 
cations network 50 is an Asynchronous Transfer Mode 
(ATM) network, in order to obtain the highest data transfer 
rate. In an ATM network, the nodes coordinate with each 
other to send fixed-size data chunks (i.e., "cells") to fully 4Q 
utilize the potential bandwidth of the network. ATM inter- 
face rates generally range from 1.5 megabits per second 
(mbps) to 620 mbps, which is suitable for carrying voice, 
data, and compressed video. It should be understood that 
when an ATM network is used, each node will have an ATM 45 
transport for communicating with the ATM network. 

Receiving node 60 is generally comprised of a combiner 
70, a decoder 80 and a video display 90. Combiner 70 
receives encoded picture data originating from each of the 
transmitting nodes 1-4 and combines them into one com- 5Q 
bined picture, as will be described in detail below. Decoder 
80 decodes the encoded combined picture data and displays 
the combined pictures on display 90. Combiner 70 and 
decoder 80 are described in detail below. 

Communications manager 65 is a system for establishing 55 
how many pictures will be simultaneously displayed at 
receiving node 60. In addition, communications manager 65 
establishes the size and display location of the pictures 
simultaneously displayed at receiving node 60. Communi- 
cations manager 65 may receive information relating to a go 
communications session (including a specified picture size 
and picture display location) from either a user or from a 
scheduler. It should be appreciated that communications 
manager 65 may be located at the receiving node or may be 
a shared resource on the network, as shown in FIG. 1. 65 

A detailed description of encoder 30 will now be provided 
with reference to FIG. 3. Encoder 30 is generally comprised 
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of a controller 20, an analog-to-digital (A/D) converter 32, 
a picture scaler 34, a compression coding device 36, a 
sequence context information inserter 38, a buffer 40, and a 
packetizer 42. 

Controller 20 provides overall control of encoder 30, and 
receives the picture size and location information from 
communications manager 65. A/D converter 32 converts 
analog "full size" picture data into digital picture data. It 
should be understood that the term "full size" as used herein 
refers to unsealed picture data which may fill a full screen or 
fill less than a fiill screen. 

Picture scaler 34 reduces the full size picture lo the 
"scaled" picture size specified by controller 20. r Yhe opera- 
tion of picture scaler 34 will now be described with refer- 
ence to FIGS. 2 A and 2B. Pictures 104 and 106 are com- 
posed of macroblocks (MB) 102, which in turn are 
composed of one or more 8x8 pixel blocks. A set of 
consecutive macroblocks 102 arc known as a "slice/' The 
shortest slice is one macroblock, while the longest slice is 
the maximum number of macroblocks in a row of a frame. 
The number of 8x8 pixel blocks and the structure of the 8x8 
pixel blocks in a macroblock will vary depending upon the 
chosen video compression standard (e.g., MPEG formats 
4:2:0, 4:2:2 and 4:4:4). FIG. 2A shows "full size" (i.e., full 
resolution) picture 104 as it fills a frame 100, which is one 
full screen. FIG. 2B shows quarter-size picture 106 as it fills 
only a quarter of frame 100. It can be seen that the scaled 
picture 106 consists of fewer macroblocks and slices than 
full size picture 104. Only the shaded macroblocks in FIG. 
213 require encoding. As noted above, a "full size" picture 
need not fill one full screen, but instead may fill only a 
portion of a screen. 

Scaling the pictures reduces the total number of macrob- 
locks that are required to be compressed. When the pictures 
are scaled, a lower output data rate of the encoder will be 
achieved, which will ultimately save transmission band- 
width in the system proportionate to the size of the reduc- 
tion. In the example shown in FIG. 2A, full size picture 104 
requires 30 rows with 45 macroblocks per row. Accordingly, 
the total macroblock requirement to send "full size" picture 
104 is 1,350 macroblocks. When the picture is reduced to 
one-fourth size (FIG. 2B), the number of macroblocks is 
reduced to 15 rows with 22 macroblocks per row, for a total 
of 330 macroblocks. 

Compression coding device 36 codes the "scaled" picture 
data using a video compression method, such as MPEG2. 
Compression coding device 36 is preferably a chip or chip 
set operable to compress macroblocks according to the 
MPEG2 standard. 

Sequence context information inserter 38 inserts sequence 
context information into the MPEG2 encoded picture data. 
The sequence context information includes picture size 
information, picture location information, as well as other 
coding parameters used in the chosen video compression 
method. The sequence context information will be used by 
combiner 70 to generate a "combined" picture, as will be 
explained below. 

It should be understood that in accordance with the 
MPEG2 video compression standard, data streams are trans- 
mitted over an ATM network using layered data structures. 
In this regard, MPEG2 coded video data is formatted in 
encoded macroblocks which are transmitted with a macrob- 
lock marker containing the other information needed for an 
MPEG2 compatible decoder. Macroblocks are packed into 
slices which are formatted into an elementary stream (ES). 
An ES may hold partial pictures, complete pictures (i.e., a 
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frame) or a group of pictures. Compression coding device information inserter 38, will generate elementary streams 

36, together with sequence context information inserter 38, (ES). In the example shown in FIG. 1, an elementary stream 

will generate elementary streams (ES) comprising picture based on I-frame coded picture (4:2:2) with a slice length of 

data and sequence context information, 22 macroblocks is generated. Compression coding device 36 

Buffer 40 stores the encoded picture data while it awaits 5 may also add picture data to define a border around the 

transmission to the receiving node. Buffering may be nec- ^oSne^^cmre ^ ^ * baCkgr ° UDd fof the 

essary for proper data transmission. Packetizer 42 packetizes CO ™ ine , P lcture " 

the encoded picture data prior to transmission. In this regard, . ™ e elementary streams (ES) are comprised of an ES 

packetizer 42 formats the elementary streams (ES) into he f ader an< * a ? ^ P ? yl ° a ^™ e Q ^ , hea , der , , ndudeS 3 

r , . , . . /r»r-o\ jr , i m relerence display location. Ine ES payload contains mac- 

packetized elementary streams (PES), and formats the pack- M roblock ^JJ nA macroblocks ^ h f cn form sliccs . ^ 

euzed elementary streams(PES) into transport streams (TS). macro51ock headers dis , location irjformalion for 

1,6 ' a . D 'P 0rt . S reamS SI*' . lfIClUde pr °S ram intorma V on the respective macroblocks , as well as other macroblock 

data PID) winch .dentines the source of the picture data information ^ di , ]ocation inforrnation in 

(i.e., the transm.tt.ng node sending the elementary streams). , he macroblock header defines a relative display location. In 

Accordrngly, the PID allows comb.ner 70 to identify the is ^ ^ macroblock neader for ^ / K( ^ ^ 

source ot the difterent pictures it receives for combination. pitfl -p „ . , i, i * .u r j- i 

T , \- c , . ™ . , . , specify a display address relative to the reference display 

In addition, packetizer 42 formats the TS into service data if -a a • .u rc u -i e i. . i_f i 

*• /enm u- u j .u i j r location specified in the ES header. Subsequent macroblock 

units (SDU), which arc arranged as the payload of ATM . A u f A . , , , .. . 

cells. In this respect, the SDU will fit into me payload of 1 ^ n ^ t S^J^ P y y ^ rClaUVC * 

exactlyeightATMcells.EachArMcellhasapayloadof48 20 location of the previous slice. 

bytes and a header of five bytes for a total of 53 bytes per After t! \ e elemen '7 strea ™ have been generated, the 

cell. The ATM cells identify the destination address of the com P/ essed P' cture data * f red in buffer 40. Packetizer 42 

elementary streams, which will be the location of combiner WlU format lhe macrabIock * P*<*ets appropriate for 

70. In the present example, this location will be receiving tran ^ission over communications network 50. It should be 

node 60. If there are several receiving nodes (each having a 25 appreciated that where communicates network 50 takes 

combiner 70), the AIM cells will identify several addresses. the /°f ra of an *™ network ' each wlU be assi § n « d 

- ,. ' .„ Arnm „ r , , a destination address identifying the destination of the 

Combiner 70 will receive ATM cells from several loca- elementary streams 

tions. As noted above, each TS cell has different PIDs, which r> rt „,u;™ to ,„ n u« a -u a - a * -i *u c 

„ m * a * • 4 l c *u • . Combiner 70 will now be described in detail with refer- 

allow combiner 70 to determine the source of the picture _ 0 tn vin A n , . _ A . n . , c 

A „ j . tl _ 4 wr»r^ * j , r . 30 ence to FIG. 4. Combiner 70 is generally cornpnsed of a 

data, and thus separate the different MPEG video channels. j rt „ • .*<?*• j ^ 

^ . ■ .. -a ,5^ • tc it ■ . • . dc-packetizcr 72, a sequence context information reader 74, 

Combiner 70 sorts incoming TS cells into appropriate n ' M t t • e t > • _J 

t i„„ n ,-„„ * . .u co i a a a memory 76, a sequence context information inserter 78 

memory locations and extracts the ES payloads and neces- and a lfc U 0. ConttoUcr 110 provides overall control 

sary header information, as will be described in detail below. of combiner 70> De . packetizer 72 de-packetizes the encoded 

Operation of encoder 30 will now be described in detail. ^ p icmre data received from the transmitting nodes. Sequence 

Communications manager 65 receives communication ses- context information reader 74 reads the sequence context 

sion information from a user or a scheduler. From this information inserted into the encoded picture data. As indi- 

information, communications manager 65 sets up a commu- cated above, the sequence c ontext information includes, 

nications session by determining how many nodes are picture size in formation. Picture location T nfo rmation. and 

connected to the communications session. In addition, com- 4Q other coding parameters. Memory 76 stores the encoded 

mumcations manager 65 establishes the video session piclure data from eacn of the transmitting nodes. Sequence 

parameters, which include picture size and the display context information inserter 78 inserts the appropriate 

location for each picture to be displayed at the receiving sequence context information into the ordered encoded 

node(s). Communications manager 65 provides video ses- picture data. This sequence context information is inserted 

sion parameters to controller 20 of each transmitting node 45 int0 an ^ hcadcr 2 20 for the combined pictures, and 

involved in the communications session. In the example specifies de co ding information such as frame rate, aspect 

shown in FIG. 1, there are four transmitting nodes, therefore ratio, size,!na dispjayj ocation for the combined picture , 

video display 90 at receiving node 60 may be divided into u shoukj be appreciated thal while com5iner 70 Qas been 

quarters to simultaneously display pictures from four dif- showD as a part of rGceiving node 60 , comb iner 70 may be 

terent transmitting nodes. 5Q arranged separate from the receiving node and provided as 

Each transmitting node generates full size (i.e., full a shared network resource. Where combiner 70 is arranged 

resolution) piclure data (e.g., 720x480, 720x575, 640x480 as a shared resource, it will also generate a new TS and ATM 

or other typical piclure size). This full size picture dala is cell specifying the addresses) of the receiving node(s). 

applied to A/D converter 32 which converts the picture data Operation of combiner 70 will now be described in detail 

to digital data. Picture scaler 34 then reduces the full size 55 with reference to FIG. 4 and FIGS. 5A and 5B. Encoded 

picture data m accordance with the picture size specified by picmre da ta is received by de-packetizer 72. The depack- 

controller 20. In the example shown in FIG. 1, the full size et i zed data is then read by sequence context information 

picture data is reduced to one-quarter size. As noted above, rcadc r 74. This allows controller 110 to analyze the sequence 

reducing the picture size reduces the transmission bandwidth contex t information associated with the picture data. It 

required to transmit the picture data. The reduction in 60 should ^ apprec iated that controller 110 may receive pic- 

bandwidth requirements is accomplished because the ture s ize and display location information from communi- 

reduced size picture data requires fewer macroblocks, as ca tions manager 65. Communications manager 65 provides 

explained above in connection with FIGS. 2 A and 2B. controller 110 with the number of transmitting nodes 

After picture scaler 34 has generated scaled picture data, involved in the video communications session, the picture 

compression coding device 36 compresses the scaled piclure 65 sizes and display locations. This information is used by 

data in accordance with a video compression method. Com- controller 110 to store picture data in the appropriate 

pression coding device 36, along with sequence context memory location in memory 76. 
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As discussed above, MPEG coded picture data is format- 
ted in slices composed of macroblocks. The slices are 
formatted into elementary streams (ES). Each elementary 
stream is comprised of an ES header and an ES payload. 
FIG. 5 A illustrates the respective ES headers 120A-120D ' 5 
and the respective ES pay loads 130A-130D for transmitting 
nodes 1-4. As can be seen each ES payload consists of slices 
134 for the respective scaled picture and macroblock head- 
ers 132. 

Controller 110 writes into memory 76 the ES payloads 10 
130A-130D for each incoming picture. Memory locations 
labelled "picture 1" store N slices corresponding to picture 
I from transmitting node 1. Likewise, memory locations 
labelled "picture 2" "picture 3" and "picture 4" respectively, 
store N slices corresponding to pictures 2, 3 and 4 from 15 
transmitting nodes 2, 3 and 4. Next, controller 110 reads the 
slices out of memory 76 to form a "combined" picture 
consisting of a plurality of scaled pictures from different 
transmitting nodes. The slices arc read out of memory 76 in 
a specified order. 2( J 

FIG. 5B shows ES header 220 and ES payload 230 for the 
combined picture. In the present example, ES payload 230 
is generated by controller 110 reading from memory 76 slice 

1 of picture 1, then reading slice 1 of picture 3. Next, slice 

2 of picture 1 and slice 2 of picture 3 are read out of memory 25 
76. This process continues until no more slices are available 
from this frame of picture 1 and 3. Next, controller 110 reads 
out of memory 76 slice 1 of picture 2, and then slice 1 of 
picture 4. Next, slice 2 of picture 2 and slice 2 of picture 4 
are read out of memory 76. This process continues until all 30 
the slices from pictures 2 and 4 have been read. In the 
present example, two slices arc provided per row of mac- 
roblocks. 

It should be appreciated that combiner 70 in no way alters 35 
the macroblock display location addresses in macroblock 
headers 132, but rather merely reorders the slices as per the 
display location address assigned by encoders 30. 

It should be noted that in the foregoing process of 
generating ES header 220 the contents of macroblock head- 40 
ers 132 are not changed. However, if desired, the contents of 
macroblock headers 132 could be modified prior to genera- 
tion of ES payload 230. It should also be appreciated that if 
simultaneous display of the combined scaled pictures does 
not fill a full screen frame, and it is desired to fill a full screen 45 
frame, a border or the like may be inserted to fill the empty 
space. 

Once ES payload 230 for the combined picture has been 
constructed, ES header 220 is attached, and the combined 
picture is ready for decoding by decoder 80. In this regard, 50 
sequence context information inserter 78 provides sequence 
context information in ES header 220 which relates to the 
"combined" picture stored in ES payload 230. For instance, 
the sequence context information may specify the display 
location of the combined picture. This is particularly impor- 55 
tant where the combined picture fills less than a full screen. 

Decoder 80 decodes the macroblocks as if they form a 
single picture. Decoder 80 then provides video information 
to video display 90 for displaying the combined picture 
comprised of a plurality of reduced-size pictures. The com- 60 
bined picture may fill the full screen or it may fill less than 
a full screen. Decoder 80 is preferably a chip or chip set 
operable to decompress macroblocks according to the 
MPEG2 or MPEG1 standard. It should be appreciated that 
decoder 80 may be configured to decode a fixed size picture. 65 
Accordingly, decoder 80 is unaware that the combined 
picture actually consists of a plurality of scaled pictures 
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from multiple sources. It should also be noted that decoder 
80 does not require any context switchable features since all 
the context information (e.g., picture size and picture 
location) for each picture is transmitted as a part of the 
headers. 

Additional post-processing may take place after decoding 
to further reduce or relocate each picture. 

The foregoing description is a specific embodiment of the 
present invention. It should be appreciated that this embodi- 
ment is described for purposes of illustration only, and that 
numerous alterations and modifications may be practiced by 
those skilled in the art without departing from the spirit and 
scope of the invention. It is intended that all such modifi- 
cations and alterations be included insofar as they come 
within the scope of the invention as claimed or equivalents 
thereof. 

Having described the invention, the following is claimed: 

1. A video communications system comprising: 

a plurality of video transmission nodes and one or more 
receiving nodes, said transmission nodes including: 
scaling meaas for generating scaled picture data from 
full size picture data, said scaled picture data defin- 
ing a reduced size picture, 
compression means for compressing the scaled picture 
data, and 

display information insertion means for inserting final 
display location and size data in a header attached to 
the compressed scaled picture data, the display loca- 
tion and size data defining the display location for 
the scaled picture data and size of the scaled picture 
data to be displayed at the one or more receiving 
nodes; and 
said receiving nodes comprising: 

display information reader means for reading display 
location and size data from the header attached to 
said compressed scaled picture data from each of 
said plurality of video transmission nodes transmit- 
ting scaled picture data; 

combiner means for combining compressed scaled pic- 
ture data from the plurality of video transmission 
nodes transmitting compressed scaled picture data 
into combined picture data defining a combined 
picture comprised of a plurality of reduced size 
pictures, said combiner means combining the com- 
pressed scaled picture data by writing the com- 
pressed scaled picture data into specified memory 
locations in accordance with the display location and 
size data read from the header attached to the com- 
pressed scaled picture data from each of said plural- 
ity of video transmission nodes transmitting scaled 
picture data, and reading the compressed picture data 
from the specified memory locations in a specific 
order to form a combined compressed picture, and 

decompression means for decompressing the combined 
compressed picture data; 

display means for displaying the decompressed com- 
bined picture data in accordance with the display 
location and size data; and 
a communications medium for connecting the plurality of 

video transmission nodes to the one or more video 

receiving nodes. 

2. A system as defined in claim 1, wherein said compres- 
sion means is an MPEG2 standard compatible device, and 
said decompression means is an MPEG2 standard compat- 
ible device. 

3. A system as defined in claim 1, wherein said commu- 
nications medium is an asynchronous transfer mode (ATM) 
network. 
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4. A system as defined in claim 1, wherein one or more of 
the video transmitting nodes is also a video receiving node. 

5. A system as defined in claim 1, wherein said combined 
picture is a single full size picture. 

6. A video communications system comprising: 5 
a plurality of video transmitting nodes, at least one of said 

transmitting nodes having scaling means for reducing a 
full size picture defined by full size picture data to a 
reduced size picture defined by scaled picture data; 
compression means for compressing the scaled picture 10 
data, 

display information insertion means for inserting final 
display location and size data in a header attached to the 
compressed scaled picture data; 35 

combiner means for combining the compressed scaled 
picture data from said plurality of video transmitting 
nodes into a compressed combined picture data, the 
combined picture data defining a combined picture 
comprised of a plurality of reduced size pictures, said 2 o 
combiner means combining the compressed scaled 
picture data by writing the compressed scaled picture 
data into specified memory locations in accordance 
with the display location and size data read from the 
header attached to the compressed scaled picture data 2 s 
from each of said plurality of video transmission nodes 
transmitting scaled picture data, and reading the com- 
pressed picture data from the specified memory loca- 
tions in a specific order to form a combined compressed 
picture, 30 

one or more video receiving nodes for receiving the 
combined picture data, the receiving nodes including 
display means for displaying the combined picture 
data; 

communication means for communicating to said display 35 
information insertion means a desired final display 
location and size of said reduced size picture on said 
display means for displaying said reduced size picture 
for each of said plurality of reduced size pictures; and 

communications link means for connecting the plurality 40 
of video transmitting nodes to the one or more video 
receiving nodes. 

7. A system as defined in claim 6, wherein said transmit- 
ting nodes further comprises compression means for com- 
pressing the scaled picture data, and said receiving nodes 45 
further comprise decompression means for decompressing 
the combined picture data. 

8. A system as defined in claim 7, wherein said compres- 
sion means is an MPEG2 standard compatible device, and 
said decompression means is an MPEG2 standard compat- 50 
ible device. 

9. A system as defined in claim 6, wherein said transmit- 
ting nodes further comprises means for providing display 
location and size data to said scaled picture data, said 
combiner means combining the scaled picture data in accor- 55 
dance with the display location and size data. 

10. A system as defined in claim 6, wherein one or more 
of the video transmitting nodes is also a video receiving 
node. 

11. A system as defined in claim 6, wherein said commu- 60 
nications link means is an asynchronous transfer mode 
(ATM) network. 

12. A system as defined in claim 6, wherein said combined 
picture is a single full size picture. 



13. A method for communicating pictures originating 
from a plurality of transmitting nodes to one or more 
receiving nodes, the method comprising: 

inputting full size picture data defining a full size picture; 

scaling the full size picture data at each transmitting node 
to generate scaled picture data defining a reduced size 
version of the full size picture; 

compressing the scaled picture data generated at each 
transmitting node; 

inserting final display location and size data in a header 
attached to the compressed scaled picture data, the 
display location and size data defining the display 
location for the scaled picture data and size of the 
scaled picture data to be displayed at the one or more 
receiving nodes; 

transmitting the compressed scaled picture data from each 
of the transmitting nodes to a combiner means, wherein 
said combiner means combines the compressed scaled 
picture data from each transmitting node into combined 
picture data, said combiner means combining the com- 
pressed scaled picture data by writing the compressed 
scaled picture data into specified memory locations in 
accordance with the display location and size data read 
from the header attached to the compressed scaled 
picture data from each of said plurality of video trans- 
mission nodes transmitting scaled picture data, and 
reading the compressed picture data from the specified 
memory locations in a specific order to form a com- 
bined compressed picture; 

decompressing the combined picture data at one or more 
of the receiving nodes; and 

displaying the decompressed combined picture data. 

14. A method as defined in claim 13, wherein said 
combined picture data defines a single full size picture. 

15. A video communication system as defined in claim 1, 
wherein said display information means is a sequence con- 
text information inserter. 

16. A video communication system as defined in claim 1, 
wherein said display information means is a controller. 

17. A video communication system as defined in claim 1, 
wherein said display information means is a communication 
manger. 

18. A video communication system as defined in claim 17, 
wherein said communication manager determines the size of 
said scaled picture data based on the number of video 
transmission nodes transmitting scaled picture data, and said 
communication manager communicates this size to said 
scaling means of each of said video transmission nodes that 
are transmitting scaled picture data. 

19. A video communication system as defined in claim 17, 
wherein said communication manager determines and com- 
municates a display location on said display means for each • 
scaled picture of each of said plurality of said video trans- 
mission nodes transmitting scaled picture data, each display 
location being different from each other display location, 
such that none of the scaled pictures overlap on said display 
means. 

20. A video communication system as defined in claim 1, 
and further including a second display information means 
for adding size and display location data to the combined 
picture data. 



06/23/2004, EAST Version: 1.4.1 



